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ABSTRACT 

Current investigation of RNA transcriptomes relies heavily on the use of retroviral reverse transcriptases. It is well known that 
these enzymes have many limitations because of their intrinsic properties. This commentary highlights the recent biochemical 
characterization of a new family of reverse transcriptases, those encoded by group II intron retrohoming elements. The novel 
properties of these enzymes endow them with the potential to revolutionize how we approach RNA analyses. 

Keywords: RNA-seq; noncoding RNA; reverse transcriptase; template switching 



In biology, even hallowed rules like the central dogma have ex- 
ceptions. Parallel to evolutionary diversification of the DNA- 
templated polymerases necessary for genome replication and 
expression, RNA-templated DNA and RNA polymerases are 
also evolutionary diverse and widespread (Ng et al. 2008; 
Finnegan 2012). Among the polymerases that counter the cen- 
tral dogma flow, the best known are the reverse transcriptases 
(RTs). The initial discoveries of retrovirally encoded RT activ- 
ity and function were recognized with the 1 975 Nobel Prize in 
Physiology or Medicine. Recombinant retroviral RTs and their 
laboratory derivatives have been crucial research tools for mo- 
lecular biologists for decades. 

RTs other than retroviral family members are less well un- 
derstood in structure or enzymology and have not yet been 
harnessed for commercial or medical diagnostic applications. 
One example is telomerase RT (Blackburn and Collins 201 1 ), 
the biological significance of which was also recognized by the 
Nobel Prize in Physiology or Medicine in 2009. In addition to 
the retroviral and cellular RTs that maintain their respective 
genomes, RTs are also exploited by selfish DNA elements for 
their perpetuation. Genome-embedded retrotransposons en- 
code RTs required for element mobility. In humans, non-LTR 
retrotransposon RTs copy polyadenosine-tailed templates to 
insert complementary DNA (cDNA) at nonspecific target 
sites, often with 5' truncation (Finnegan 2012). In contrast, 
mobile group II introns encode RTs that copy the intron 
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RNA to insert a precise full-length cDNA into an intronless al- 
lele of the host gene (Lambowitz and Zimmerly 2004). This 
process, termed retrohoming, must maintain the host gene 
exon sequences as precisely as the intron sequence to allow 
functional transcript production by intron splicing. Genome 
projects have unearthed hundreds of group II introns, mainly 
in prokaryotes and the organellar genomes of fungi and plants 
(Lambowitz and Zimmerly 2011). 

Group II intron RTs synthesize long cDNAs with high fidel- 
ity, but they have remained untapped as a source of RTs for 
biotechnology applications. Group II intron RTs associate 
tightly with their coevolved intron RNA templates, like non- 
LTR retroelement RTs and telomerase. This stymies the po- 
tential commercial applications of these RTs in copying heter- 
ologous RNA templates. Furthermore, due to a physiological 
dependence on protein and RNA cofolding, RTs other than 
retroviral family members have not been amenable to recom- 
binant protein expression. After many years of groundwork, 
the Lambowitz group reports in RNA a landmark study attain- 
ing robust high-level recombinant expression of group II 
intron RTs, focusing on thermostable enzymes from the 
bacterial thermophiles Thermosynechococcus elongatus and 
Geobacillus stearothermophilus (Mohr et al. 2013). The meth- 
ods used to produce these thermostable group II intron RTs 
are generalizable to the group II intron RTs from other evolu- 
tionary branches of the tree of life. 

As would be predicted from their biological function, group 
II intron RTs have higher processivity, fidelity, and thermosta- 
bility than retroviral RTs (Mohr et al. 2013). Retroviral RTs 
copy their own genomes, or heterologous templates, with 
high error rates relative to cellular DNA polymerases as an 
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evolutionarily optimal mechanism for virus evolution (Duffy 
et al. 2008). In contrast, retrohoming group II intron RTs 
must synthesize an accurate cDNA copy of the intron RNA, 
which can be quite long and also must be folded into the stable 
secondary and tertiary structures that support self-splicing. 
Thus, by evolutionary selection, group II intron RTs must 
have high processivity of cDNA synthesis on challenging tem- 
plate RNAs and also have high cDNA synthesis fidelity. 

Even without optimization with rounds of structure-guid- 
ed design and genetic selection, group II intron RTs have ma- 
jor advantages for RT applications, including quantitative RT- 
PCR and next- generation RNA sequencing. For example, 
thermostability of the group II intron RTs allows cDNA syn- 
thesis at temperatures that denature template RNA secondary 
structure, and their high processivity enables the synthesis of 
long cDNAs with minimal background from premature ter- 
mination (Mohr et al. 2013). Perhaps the most striking and 
unanticipated property of the group II intron RTs is their abil- 
ity to perform end-to-end template switching. Mohr and col- 
leagues (Mohr et al. 2013) demonstrate that a DNA/RNA 
hybrid "primer" with a single 3' nucleotide DNA overhang 
can, with high efficiency, form a base pair with the 3' terminus 
of any target RNA. The DNA component of the "primer" is 
then extended by group II intron RT synthesis across the en- 
tire target RNA template. This method has many new applica- 
tions and advantages over retroviral RTs for cDNA synthesis 
and library construction. For example, this approach elimi- 
nates the cumbersome, often inefficient, highly sequence-bi- 
ased RNA ligation step common to many library construction 
kits. Moreover, this approach can be used to obtain cDNAs for 
otherwise "unclonable" RNAs with a modified or structured 
3' end. It will be of considerable interest to investigate what 
types of RNAs have remained hidden from identification 
simply because they are not captured with current cloning 
methodologies. 

Remarkable new feats of cDNA synthesis are described in 
this initial RNA article, but, of course, myriad others are pos- 



sible to envision. Based on our experiences chairing recent 
National Institutes of Health Common Fund study sections, 
the surging interest in extracellular RNA profiling as a diag- 
nostic would benefit from less-biased cloning of structured 
RNAs as well as processed short microRNAs in serum samples. 
Also, unlike the retroviral RTs, group II intron RTs are selec- 
tive for RNA versus DNA templates, a specificity that will 
greatly reduce complications arising from DNA contamina- 
tion of RNA preparations for cDNA library construction. 
Starting from cross-linked chromatin fragments, group II in- 
tron RT cDNAs could be joined to nearby genomic DNA to 
obtain an unbiased genome-wide map of which noncoding 
RNAs are associated with which DNA loci. 

Clearly, applications using recombinant group II intron 
RTs have enormous promise for the improvement of human 
health. It is notable that, against the wisdom of current fund- 
ing trends, the National Institutes of Health made possible 
these applications by funding basic research on the mecha- 
nisms of DNA mobility in expedient, nonmodel organisms. 
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